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Abstract. It has been known for almost 200 years that some angles cannot be 
trisected by straightedge and compass alone. This paper studies the set of such 
angles as well as its complement T, both regarded as subsets of the unit circle S 1 . 
It is easy to show that both are topologically dense in S 1 and that T is contained in 
the countable set A of all angles whose cosines (or, equivalently, sines) are algebraic 
numbers (Corollary 3.2). Thus, T is a very "thin" subset of S 1 . Pushing further 
in this direction, let if be a real algebraic number field, and let Tk denote the 
set of trisectable angles with cosines in K. We conjecture that the "computational 
density" of Tk in K is zero and prove this when K has degree < 2 (cf. §1.2 
and Theorem 4.1). In addition to some introductory field theory, the paper uses 
elementary counting arguments to generalize a theorem of Lehmer (Theorem 5.2)on 
the density of the set of relatively prime n-tuples of positive integers. 



1. Introduction 



In 1837, P.L. Wantzel proved that there exist angles that cannot be trisected by 
strict use of straightedge and compass alone |Wan] (see also, §1.1 below for further 
discussion). An easy observation based on his argument (Cor. 3.2) shows that the 
cosine of each trisectable angle must be an algebraic number, and thus the trisectable 
angles comprise merely a countable subset of the unit circle S 1 . This paper pushes 
further in the direction of showing how rare trisectable angles are. To this end, we 
let A C S 1 be the countable set of all angles with algebraic cosines and let T C A be 
the subset of trisectables. The main result of this paper (Theorem 1.1 or Theorem 
4.1) provides evidence for the conjecture that T is a very "thin" subset of A. 
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Statements of our results will be given at the end of this introduction, as will a 
description of how the remainder of the paper is organized. First, however, we present 
some context for the angle trisection problem. 

1.1. A thumbnail history of the angle trisection problem. The classical angle 
trisection problem, which originated with Greek mathematicians in the 5th century 
B.C.E., requires that for each angle a one find a geometric procedure that starts with 
a and ends with a/3. From that period on, numerous solutions to the problem have 
been given |Dud] , many of them very ingenious. None of these, however, make exclu- 
sive use of the simple straightedge and compass, requiring instead auxiliary devices: 
e.g., marks on the straightedge, special curves, such as the Quadratrix of Hippias 
and the Conchoid of Nicomedes [Dud], and other devices, such as the "Shoemaker's 
knife" [Hen]. 

A purist strain in Greek geometry, said to originate with Plato and encoded in 
the axiomatics of Euclid's Elements, devalued the use of such auxiliary methods in 
geometric constructions. Hence constructions using only compass and straightedge 
have been called Euclidean [Dud]. What has come to be called "the angle trisection 
problem" in subsequent times is the problem of finding, for each a, a Euclidean 
geometric construction that starts with a and produces a/3. 

A Euclidean geometric construction may be described more precisely as a finite 
sequence of steps starting with at least two distinct points in the plane such that 
each step is a construction of one of the following two types: (1) If two distinct points 
exist at a prior step, then the line they determine or the circle centered at one of 
them and passing through the other may be constructed. (2) If two lines, two circles, 
or a line and a circle exist at a prior step, then their points of intersection, if any, 
may be constructed. 

In this paper, all further references to constructions, trisections, trisectability, and 
the like, will assume the Euclidean restriction. 

After many failed efforts at solving the Euclidean angle trisection problem, it be- 
came widely believed, even in Euclid's time, that a solution was impossible. However, 
this belief was not supported by proof. 

Further progress on the Euclidean angle trisection problem was not achieved until 
the development of trigonometry and algebra by Arab mathematicians. Around 1430, 
the Arab mathematician Al-Kashi showed that the trisectability of a given angle a 
is related to the solvability of a certain cubic polynomial |Hog] . Presumably, this is 
essentially the polynomial q(x, b) described in the following paragraph, which derives 
from the trigonometric identity expressing sin(a) in terms of sin(a/3). In 1569, the 
Italian mathematician Rafael Bombelli independently demonstrated the same con- 
nection between trisectability and algebraic solvability |Bor] . However, with both Al 
Kashi and Bombelli, the concepts of "trisectability" and "solvability" remain unclear. 
Neither mathematician gives an algebraic criterion for Euclidean constructibility. Nor 
does either specify what kind of algebraic solutions he has in mind. Finally, neither 
makes a claim to having proved the impossibility of solving the trisection problem. 
Nevertheless, despite these gaps, the connection these mathematicians obtain between 
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the purely geometric problem and an algebraic one was a major breakthrough that 
foreshadowed the ultimate solution 

This solution was finally and decisively achieved by the French mathematician 
Pierre-Laurent Wantzel (1814-1848) [Wan], as stated at the outset of this paper. By 
that time, the connection between trisectability and properties of the cubic polynomial 
q(x, b) = 4x 3 — 3x+b must have been well known, because Wantzel uses the polynomial 
without further comment. The parameter b represents the sine of a given angle a, 
and one of the roots of q{x,b) is sin(a/3). Wantzel demonstrates that constructible 
quantities must be zeros of irreducible polynomials of degree a power of two, and he 
uses this criterion to deduce that a is trisectable if and only if q(x, b) is reducible over 
the field Q(b). (To be sure, he does not use the language of fields, since the concept of 
a field was not fully developed until the 1850 's.) Since there are many b in the interval 
[— 1, 1] for which q(x,b) is not reducible over Q(&), there are, correspondingly, many 
angles for which there is no Euclidean trisection procedure (cf. Corollary 2.2). 



1.2. Some terminology, statements of results and organization of the paper. 

For technical convenience in our work and in the statements of results, we shall 
replace sin(a) by 2cos(a). This amounts to replacing the polynomial q{x,b) by the 
polynomial p(x, a) = x 3 — 3x — a, which we use throughout the rest of this paper. 
Here the parameter a represents 2cos(a;). Wantzel's argument applies equally well 
to p(x, a), so we can rephrase his result as: a is trisectable if and only if p(x,a) is 
reducible over the field Q(a). 

Next, we normalize our discussion by using the Cartesian plane M 2 , with its usual 
terminology and notation. Points in IR 2 will be called constructible if they can be 
obtained from the set of points {(0, 0), (1, 0)} by a Euclidean construction. More 
generally, given any set 5* C R 2 that contains {(0, 0), (1, 0)}, any point for which 
there is a construction starting with points in S is said to be constructible over S. 
The set of all such points will be denoted by C(S). An angle a is identified in the usual 
way with the point (cos(a), sin(ot)) on the unit circle S 1 , which allows us to talk about 
constructible angles, a is said to be trisectable if a/3 G C({(0, 0), (l,0),a})Q. Since 
this last relation bears no obvious connection to the relation a G C({(0,0), (1,0)}), 
there is no reason to suppose that a trisectable angle need be constructible. 

Some examples are in order. First note that the following angles are constructible: 
a = 7r/3; (3 n = 7i/2 n ; ^ n = n/3 ■ 2 n ; e n = 7r/2 n + n/3. The reasons are: a is the angle 
in an equilateral triangle; (3 n can be obtained from tc, which is obviously constructible, 
by repeated bisection; 7„ can be obtained from a by repeated bisection; e n = (3 n + a. 
Here, n is any non-negative integer. 



"The notation "a/3" here represents 1/3 of a in angular measure and should not be confused with 
scalar multiplication of the point a by the scalar 1/3. A similar caveat applies to the examples of 
angles that we give later. 
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Now, a = 7r/3 is the most commonly presented example of a non-trisectable angle: 
for 2cos(7r/3) = 1, and p(x, 1) is easily shown to be irreducible. Therefore, we may 
conclude that e n is not trisectable: for if it were, then starting with e n we could 
construct e n /3 = 7 n + a/3, and from that we could subtract j n , obtaining a/3. Since 
e n is constructible, we could conclude that a/3 is constructible, hence, a fortiori 
constructible over {(0, 0), (1, 0), a}, which was just shown to be impossible. 

By the same arguments, we could conclude that all angles of the form k(3 n are tri- 
sectable, k an arbitrary integer, and all angles of the form kf3 n + a are non-trisectable. 
Both these sets are countable, dense subsets of the unit circle. Of course, all of these 
examples are constructible. 

At this point it becomes convenient to focus our attention away from the angles 
themselves and toward their cosines. Consider any number a £ [—2,2]. We call a 
a trisection number if there is a trisectable angle a such that a = 2cos(a), and we 
denote the set of all trisection numbers by Tri. As we show in Corollary 3.2 (and 
have already mentioned earlier), Tri C A, where A is the set of algebraic numbers. 
For any real algebraic number field K, we define the density of TridK in [—2, 2] C\K 
in §4, and we denote it by 8k {Tri). 

Conjecture 1. 8k (Tri) = 0. 

Theorem 1.1 (Main result). Conjecture^ is true when K is the field of rational 
numbers Q or a real quadratic field. 

Now we present a selection of other results in the paper which may be of indepen- 
dent interest. 

The following three results in §§3.2, 3.3 give further examples of non-trisectable 
angles : 

Corollary 3.2 a is non-trisectable whenever cos(a) is transcendental. Therefore, Tri 
is countable and the set of non-trisectable angles is uncountable. 

Proposition 3.2 If a is a non-zero square in Q, then a Tri. 

Proposition 3.3 Let r and s be any non-zero integers prime to each other and to 
3. Then p(x , 3r / s) is irreducible over Q. Hence, 3r/s £" Tri . 

Lest the reader be left wondering whether there are any non-trisectable angles with 
irrational algebraic cosines, we prove the following result in Appendix C about the 
non-trisectable angles tt/3 + 7r/2 n : 

Theorem The numbers cos(7r/3 + 7r/2 n ) are algebraic of degree 2". 

Proposition 3.4, together with its addendum, implies that there exist a countable 
number of trisectable angles that are not constructible. 
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Theorem 4.1 Let K be a real number field of degree k < 2. Then 

8 K (R) is £>(iH (fc+1) ). 



Here Sk{R) counts the number of trisection numbers in K that have height < R 
and divides this by the number of members of [—2, 2] D K of height < R. See §4 for 
the definition of height and further notation. Using the usual definition of the big 
"(9" notation, it is clear that Theorem 4.1 implies Theorem 1.1. 

In § 5, we estimate the number of relatively prime positive integer /c-tuples (ai, . . . , a^) 
satisfying a« < rij, where n = (m,...,nfc) is a fc-tuple of positive real numbers 
(Theorem 5.2). This generalizes a theorem of Lehmer and Sittinger [SitJ. A con- 
sequence of this theorem is an asymptotic relation, which is easier to state than the 
theorem itself: 

rii • . . . • n k 



\C(k,n) 



The term on the left denotes the number of integer fc-tuples (ai, . . . , a&) being counted 
and £ denotes the Riemann zeta function. See § 5 for further definitions. 

Finally, in Appendix A, we prove some results about n-section of angles, by which 
we mean a Euclidean construction that starts with an angle a and produces the angle 
a/n. 

Theorem A.l Suppose n is a positive integer such that for any given angle a, there 
is a Euclidean construction that starts with a and ends with a/n. Then, n has the 
form 2 k , for some non-negative integer k. 

Therefore, for any n not a power of 2, there exist non-n-sectable angles. But can 
we assert, as in the case of non-trisectable angles, that, for fixed n (not a power of 
2), the set of all non-n-sectable angles is dense in S 1 , or similarly for the set of all 
n-sectable angles? 

In general, this may not be so easy. However, we can assert this to be the case for 
special n: 

Propositions A.l and A. 2: If n is an odd positive integer such that 27r/n can be 
constructed (i.e., the regular polygon of n sides can be constructed), then there exists 
a countable dense subset of S 1 consisting of n-sectable angles and a countable dense 
subset of S l consisting of non-n-sectable angles. 

As is well known, Gauss showed that 2tt /n can be constructed provided <f)(n) is a 
power of 2. Here, denotes the Euler phi function. Among odd integers n for which 
Gauss's condition holds are the integers n = 3,5, 17, 257, 65537. 

All of the angles in these two propositions have cosines that are algebraic numbers. 
The transcendental case is covered by the following theorem in Appendix A, which 
extends Corollaries 13. II and 13.21 of the main text. 
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Theorem A. 2 Suppose that cos(a) is transcendental and that n is a positive integer 
that is not a power of 2. Then a is not n-sectable. Therefore, the set of all non-n- 
sectable angles is uncountable, and the set of all n-sectable angles is countable. 



The paper has three main parts. First, Sections 1-3 consist of background and 
several results that produce classes of examples of both trisectable and non-trisectable 
angles. Second, Sections 4-7 prove our main result on density (the precise version 
of which is Theorem 4.1). The proof involves a variety of counting and estimation 
arguments, including a generalization of a theorem of Lehmer. Third, the Appendix 
contains miscellaneous supplementary results. One section (Appendix B) shows that 
our main result is independent of choice of basis. Another section (Appendix A) 
extends some of the results of Sections 2 and 3 to the case of n-section of angles. And, 
finally, Appendix C shows that the algebraic number cos(7r/3 + vr/2" - ) has degree 2 n . 

At this point, I wish to thank Ravi Ramakrishna for a number of helpful conversa- 
tions and suggestions. I also wish to thank George Wilson and Michael Nussbaum for 
their help with the Italian article on Bombelli [Bor] . Michael Nussbaum's assistance 
was particularly helpful in enabling me to assess Bombelli's contribution to the angle 
trisection problem. Finally, I wish to thank Benjamin Kahn and Kay Wagner for 
some interesting questions. These are answered in Theorems A.l and A. 2. 
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2. Basic facts about constructibility and trisectability 

Most of the results described in subsections 2.1 and 2.2 are either well known or 
easily derivable. They are presented here as background for the reader. 
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2.1. Constructible points and numbers. The basic facts about constructible 
points and numbers are carefully described in various classical texts (e.g., |Waej . 
[Cou] ). so we give only a brief outline here. 

Let S be a subset of R 2 . We have already defined the set of points C(S) con- 
structible over S in §1.2. We shall now identify in the usual way the field R of real 
numbers with the set of all points in R 2 of the form (x, 0). If S C R, then we call the 
elements of C^(S) = C(S) flR numbers constructible over S (or simply constructible 
numbers when S = {(1, 0), (0, 1)}). 

Because the four elementary operations of arithmetic can be realized by Euclidean 
constructions, it follows that the sets C(S) are closed under these operations, so 
these sets are subfields of R. Furthermore, it follows directly from definitions that 
Cr(C-r(S)) = Cr(S'). Thus, we lose no generality by assuming that S itself is already 
a subfield of R. The further equality Cr(S) x Cr(5) = C(S) shows that we lose no 
information about points constructible over S by focusing on numbers constructible 
over S. 

The field C«(Q) is called the field of constructible numbers, and its subfields are 
called constructible fields. It follows from our comments above that if K is a con- 
structible field, then C R (Q) = C R (K). 

Now consider some Euclidean construction over S. By elementary plane geometry, 
the coordinates of each newly constructed point are zeros of polynomial equations of 
degree at most two, and the coefficients in each such equation are rational functions of 
the coordinates of points that have already been constructed over S. This description 
leads immediately to the following fundamental algebraic fact about constructible 
numbers: 

Theorem 2.1. Let K be a subfield o/R. A real number x belongs to C^(K) (i.e., 
is constructible over K) if and only if there is a finite tower of real, quadratic field 
extensions 

K = K GK 1 C...CK n , 

such that x e K n . 

It follows immediately that Cr(Q) is a real subfield of the field A of algebraic 
numbers. 

Suppose that the real number b is constructible over the real field K and that 
K = Kq C . . . C K n is a tower as above with b G K n . Then, by Theorem I2.1[ the 
minimal polynomial of b over K must have degree of the form 2 k for some k < n. 
When K = Q, this is called the degree ofb. Therefore, every b e Cr(Q) has degree a 
power of 2. 

2.2. Trisectable angles. We recall that the angle a is trisectable if the angle a/3 is 
constructible over the set {(0, 0), (1, 0), a} 

We set a = 2 cos(a). It is easy to see that a/3 is constructible over {(0, 0), (1, 0), a} 
if and only if cos(a/3) is constructible over the field Q(cos(a)) or, equivalently, 
2cos(o;/3) is constructible over the field Q(2cos(o;)) = Q(a). This second formu- 
lation is often slightly more convenient for our algebraic computations. We use either 
formulation without further comment. 
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It is possible for the angle a to be constructible without being trisectable (e.g., 7r/3, 
as mentioned in the introduction) and to be trisectable without being constructible. 
We give examples of the latter in §3.4. 

We now invoke a standard trigonometric identity to relate the quantities 2 cos(a) 
and 2 cos(a/3): 

(1) 2 cos(a) = (2 cos(a/3)) 3 - 3(2 cos(a/3)). 

That is, using a = 2cos(a), as above, 2cos(a/3) is a zero of the monic polynomial 

p(x, a) = x 3 — 3x — a G Q(a) [x] . 

Theorem 2.2. The angle a is trisectable if and only if p(x,a) is reducible over the 
field Q(a), where a = 2cos(a). 

As indicated in our introduction, an equivalent fact was demonstrated by Wantzel. 
We provide a modern version of his proof here for the reader's convenience. 

Proof. <=: Assume that 2cos(a/3) is constructible over Q(cos(a)) = Q(a). Then , 
by Theorem 1, the minimal polynomial / of 2cos(a/3) over Q(a) has degree 2 n , for 
some integer n. Thus, the degree of / is not equal to or 3. Since / divides p(x, a), 
p(x,a) is reducible in Q(a)[x]. 

=>: If p(x,a) is reducible over Q(a), it factors as the product of a linear term and 
a quadratic term in Q(a)[x]. Since 2cos(a;/3) is a zero of p(x,a), by the foregoing 
trig identity, it must be a zero of one of the factors. Therefore, in either case, it is 
constructible over Q(a), by Theorem 1, and so a is trisectable. □ 



Corollary 2.3. 7r/3 is not trisectable, which means that it is impossible to find a 
Euclidean trisection construction for each angle. 

Proof. When a = n/3, we have 2cos(a) = 1, so that p(x,a) = x 3 — 3x — 1. The 
reader can easily check by a direct computational argument that this polynomial is 
irreducible over Q(l) = Q. Alternatively, we give an argument that uses Eisenstein's 
criterion |Wae] . The polynomial f(x) = p(x — 1, 1) = a; 3 + 3x 2 — 3 satisfies the 
conditions of Eisenstein's Theorem. Therefore, f(x) is irreducible. It follows that 
p(x, 1) = f(x + 1) is irreducible. Now apply the preceding theorem. □ 



3. Examples of trisectable and non-trisectable angles 

3.1. Multiples of it. As we already indicated in the introduction, every integral 
multiple of ir/2 n is both trisectable and constructible and these form a countable, 
dense subset of S 1 . Indeed, it is easy to see that the set of all angles that are both 
trisectable and constructible form a dense subgroup of S 1 under angle addition. It 
follows from Corollary 13.21 that this subgroup is countable. We do not make use of 
the group structure in this paper. 
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R. C. Yates |Yat] proves the following easy generalization of the fact that the integer 
multiples of 71-/2™ are trisectable. 

Proposition 3.1. Suppose that the integer k is not a multiple of 3. Then every 
multiple ofir/k is trisectable. 

Proof. Since k is relatively prime to 3, there are integers a and b such that 3a + bk = 1. 
Multiply both sides of this equation by n/3k: a(ir/k) + bir/3 = it/ 3k. Since n/k is 
given and 7r/3 is constructible, it follows that Tc/3k can be constructed from 7r//cjj □ 

Suppose now that a is a constructible angle that is trisectable and (3 is a non- 
trisectable angle. Then a + (3 cannot be trisectable. For if it were, then, starting with 
f3, we could construct a + (3, hence (a + (3)/3, and from that, (3/3, contradicting the 
non-trisectability of (3. 

Since we have shown above that there is a countable dense subset of S 1 consisting 
of angles that are constructible and trisectable, and since we have seen that non- 
trisectable angles exist, this shows that the set of non-trisectable angles contains a 
countable subset that is dense in S 1 . Shortly, we demonstrate a much stronger result. 

3.2. Trisection numbers and the countability of the set of trisectable angles. 

We give further examples of non-trisectable angles. First, we restate the definition of 
a trisection number for emphasis. 

Definition 1. A trisection number is any real number of the form 2cos(a), for some 
trisectable angle a. We denote the set of all trisection numbers Tri. 

We begin with the following lemma. 

Lemma 3.1. Let t be an indeterminate. The polynomial p(x,t) G Q(t)[x] is irre- 
ducible. 

Proof. Suppose the conclusion is false. Then p(x, t) has a zero in Q(t), which we may 
write as A/B, where A and B are relatively prime polynomials in Q[t]. The equation 
p(A/B,t) = implies that A(A 2 - 3B 2 ) = tB 3 , and this, in turn implies that B\A 2 , 
which is impossible unless B is a unit in Q[t], i.e., a non-zero constant polynomial, 
which we may absorb in A. Therefore, p(A,t) = 0, which implies A ^ and A\t. So, 
either there is a non-zero rational number c such that A = c or there is a non-zero 
c such that A = ct. Either choice gives a non-trivial algebraic relation satisfied by t 
over Q , p(A, t) = 0, which is impossible. □ 

See the proof of Theorem A. 2 in Appendix A for an alternative proof of this lemma 
that uses Eisenstein's criterion. 

Since Q(a) = Q(£) when a is transcendental, the following corollary is immediate: 
Corollary 3.1. p{x, a) is irreducible overQ(a), for every transcendental a. 



tYates gives a faulty proof of the inverse of this proposition. He succeeds only in proving that if k 
is a multiple of 3 and if n/k is constructible, then it is not trisectable 
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Corollary 3.2. a is non-trisectable whenever cos(a) is transcendental. Therefore, 
Tri is countable and the number of non-trisectable angles is uncountable. 

Proof. Let a = 2cos(a) and b = 2cos(a/3). We have shown that the following 
assertions are equivalent: a is non-trisectable; b is not constructible over Q(a); p(x, a) 
is irreducible over Q(a). Thus, by Corollary 13. 1\ when a is transcendental, a is not 
trisectable. Therefore, Tri is a subset of A n R, and so it is countable. Since a t— > a 
is at most 2 — 1 and is onto [—2, 2], the set of non-trisectable a is uncountable. □ 

This corollary shows that to find a trisectable angle a (or rather, to find the cor- 
responding number 2cos(a) = a), we can require, without loss of generality, that we 
search for a among the algebraic numbers. Therefore, we assume throughout the rest 
of this paper that a is algebraic. 

In Appendix A, we generalize Corollary 13.21 to the case of n-sectable and non-rz- 
sectable angles, for every n that is not a power of 2. 

3.3. Non-trisectable angles with rational cosines. We now focus on the numbers 
a = 2 cos(a) to describe some examples of non-trisectable angles. The next two 
results display countably many examples of rational a G [—2, 2] that are not trisection 
numbers. Contrast these with the non-trisectable angles f + jpr, whose cosines are 
constructible numbers of arbitrarily high degree over Q (cf. Appendix C). 

Proposition 3.2. If a is a non-zero square in Q D [—2,2] then a (jL Tri. 

Proof. Let E be the projective elliptic curve whose affine equation is y 2 = x 3 — 3x. It 
is well known that the set of rational points on E form a finitely-generated abelian 
group of rank and torsion group Z 2 (e.g., see |Hus] . pp. 33-35). That is, the only 
rational points on E are [0, 0, 1] and [0, 1, 0], the point at infinity. It follows that there 
is no non-zero rational c such that c 2 = x 3 — 3x has a rational solution. So, p(x, c 2 ) is 
irreducible over Q = Q(c 2 ), for all non-zero, rational c. The result now follows from 
Theorem 2 by restricting to non-zero, rational c such that c 2 < 2. □ 

Proposition 3.3. Let r and s be any non-zero integers prime to each other and to 3. 
Then p(x , 3r / s) is irreducible over Q. Hence, no such 3r/s in [—2,2] belongs to Tri. 

Proof. The irreducibility of p{x,3r/s) is an immediate consequence of the Eisenstein 
Criterion and the Gauss Lemma. □ 

Remarks: a) Any real number field K (i.e., subfield of R of finite degree over Q) 
whose integers admit unique factorization can be used in place of Q in this proposition, 
provided 3 is a prime in the ring of integers of K. 

b) The foregoing results show that both K fl Tri and (K fl [—2, 2]) \ (K R Tri) are 
big subsets of K fl [—2, 2] for many real number fields K. In the next few sections, 
we show in a number of cases that K fl Tri is much the smaller of the two. 
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3.4. Non-constructible trisection numbers. We conclude with a family of exam- 
ples of real number fields containing countably many trisection numbers that are not 
constructible. We remind the reader that these correspond to trisectable angles that 
are not constructible. 

First, it will be convenient to make use of the polynomial function / : R — > R given 
by the equation f(x) = x 3 — 3x, so that, for each real x and a, p(x, a) = f(x) — a. 
Set y = f(x). It is easy to check that if — 2 < x < 2 (resp, x < —2, x > 2), 
then —2<y<2 (resp., y < —2, y > 2). It follows immediately that /([— 2,2]) = 
[-2,2] (resp., f'\[-2,2}) = [-2,2] , so that f{3 n [-2,2]) = f(S) n [-2,2] (resp., 
f-\S n [-2, 2]) = f-\S) n [-2, 2]) for every S. The following lemma will be useful 
for our later computations. We leave the easy verification to the reader. 

Lemma 3.2. Let K be any subfield of R. Then Tri n K C f(K n [-2, 2]). 

The following proposition now forms the basis for the examples just mentioned. 

Proposition 3.4. Let K be a finite, constructible extension ofQ, and let F be a real, 
finite extension of K such that [F : K] is prime to 2 and 3. Then: (a) Tri fl F = 
f(F n [—2, 2]); (b) No element of f(F \ K) is constructible. In particular, when 
F 7^ K , f(F\ K) contains a countable infinity of trisection numbers that are not 
constructible. 

Proof, (a) By the lemma, it suffices to prove that TridF D f(Fn [— 2, 2]). So, choose 
any a G f(F fl [-2,2]) = f(F) n [-2,2]. That is, a is in [-2,2] and is of the form 
a = b 3 — 36, for some b G F. The degree dofb over K(a) divides [K(b) : K], which, in 
turn, divides the odd number [F : K]. Since b is a zero of p(x, a) G K(a)[x], d cannot 
be greater than 3. Since it is odd and prime to 3, it must equal 1, so K(a) = K(b). By 
construction, p(x,a) is reducible over K(b), so it is reducible over K(a). Now, since 
a G [—2,2], we may write a as a = 2cos(a), for some angle a. Then 2cos(a/3) is a 
zero of p(x, a) by the definition of p(x, a). We now argue as in the proof of Theorem 
12.21 Since p(x, a) factors into the product of a linear and a quadratic polynomial 
over K(a), and since 2cos(a/3) is a zero of one of these, Theorem 12.11 implies that 
2cos(a/3) is constructible over K(a). 

Now let Q = L < L\ < . . . < L n = K be a tower of real quadratic extensions, 
which exists by the hypothesis on K. Then, Q(a) — Lo(a) < ... < L n (a) = K(a) is 
a tower of real field extensions, each at most quadratic. It follows that 2cos(a/3) is 
constructible over Q(a), which shows that a is trisectable. 

Therefore, a = 2 cos(a) G Tri fl F, as required. 

(b) Now suppose that b G F \ K, and set a = f(b), as above. The argument in 
(a) shows that K(a) = K(b), so, in particular a ^ K. Therefore [K(a) : K] ^ 1. 
Further, [K(a) : K] is odd, since it divides [F : K\. Therefore, by Theorem 12. 1[ a is 
not constructible over K. So it is not constructible over Q. Since F fl [—2, 2] \ K is 
infinite, and / is at most three to one, f(FD [—2, 2] \K) is an infinite set of trisection 
numbers (by part (a)) none of which is constructible. 

□ 
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Addendum to Proposition 13. 4t Let K be any real number field and m any pos- 
itive integer. There exists an extension F of K such that [F : K] = m. When m is 
odd, we may choose the extension to be real. 

Therefore, there exist many instances of constructive fields K and extensions F of 
K as described by Proposition 13.41 . i.e., the proposition is not vacuous. 

We thank Ravi Ramakrishna for suggesting the following proof of the addendum, 
which we give in three steps. 

Step 1: Let R be the ring of integers of K, and choose any proper prime ideal 
p C R. Let 5* be the localization R p of R at p, and let q be the extension of p to S. 

Step 2: q is the unique maximal ideal of the local ring S. The non-zero ideals of 
S are precisely the non-negative powers of q, all of which are distinct. Choose any 
q G q \ q 2 . Since 5* is a Dedekind domain, the ideal (g) equals a unique non- negative 
power of q, which must be the first power. Therefore, S is a principal ideal domain. 

Step 3: By Step 2, we may apply Eisenstein's criterion to the polynomial x m — q G 
S[x], concluding that it is irreducible. Since K is the field of fractions of 5", Gauss's 
Lemma implies that x m — q is irreducible in K[x\. Letting c be any zero of x m — q 
(real, if m is odd), the field F = A(c) satisfies the desired condition. 

4. The density of Kn Tri in K n [-2,2]: preliminaries and an overview 

We have seen that the set Tri of trisection numbers consists of real algebraic num- 
bers in [—2, 2], i.e., Tri C An Kfl [— 2, 2]. As a step toward getting more information 
about the global structure of Tri, we specialize to a number field K C Afll, and we 
attempt to compute the density of Tri D K in K D [—2, 2]. 

4.1. Height and density. One way to define density in this context is to make use 
of a so-called height function 

h K : K — >■ (0, oo). 

The definition of hx that we have in mind is a simplified version of what is used in 
Diophantine Geometry (cf. |Lanj ). We begin by choosing a fixed Q- vector space basis 
V = {vt,v 2 , ...v k } of K. 

Lemma 4.1. Every x G K can be written uniquely as 

(2) x = (aivi + . . . + a k v k )/b, 

for integers a±, . . . ,a k ,b satisfying 

(a) b > and 

(b) di, . . . dfc, b have no prime factors in common. 

We leave the proof to the reader. 
Then, using (T5]), we define 

(3) h K (x) = max{\ai\, . . . ,\ak\,b}. 
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For any real, positive R, the set 

B K (R) d M h K \0,R] 

is finite, and so, its cardinality \Bk(R)\ is a non-negative integer. Clearly, if 1Z is any 
unbounded subset of (0, oo), then 

|J B K (R)=K. 
Ren 

For sufficiently large R, the density 6k(R) of Tri in B K (R) D [—2,2] is defined to 
be the ratio 

m x (m \TrznB K (R)n[-2,2}\ 

Alternatively, it might be called the relative frequency of occurrence of elements of 
Tri in Bk(R) H [—2, 2]. If the limit lim^oo 5k (R) exists, we call it the density of Tri 
in K fl [—2, 2], and we denote it by Sk( Tri). It can be viewed as the probability that 
a randomly selected element of K fl [—2, 2] belongs to Tri. We now 



Conjecture 1 (Main Conjecture) 5x(Tri) = 0. 



Note that the definitions of h K and B K (R) depend on the choice of V. And so our 
density function depends on this choice. It is not hard, however, to show that the 
height function corresponding to another choice of basis will be commensurate to the 
first. This enables us to show that if Conjecture 1 holds for one choice of basis, it will 
hold for any other. We present some details of this discussion in Appendix B. Here 
we simply proceed with the definitions arising from a fixed V. 

Next, we wish to describe our computational strategy for estimating the densities 
(jlj). This will make use of some standard "estimation language," which we briefly 
spell out for the reader's convenience. 

4.2. Estimation. We are interested in estimating values of real-valued functions as 
the arguments get large. Usually this is done for functions with some standard do- 
main, such as the real numbers or the integers. However, we need to look at a broader 
class of domains. Accordingly, we let X be a locally-compact Hausdorff space with 
countable basis, and we let T denote the set of (not necessarily continuous) real- valued 
functions /onl such that / _1 (0) has compact closure. 

Let f,g G J 7 , with g > outside some compact set. Then / is said to be 0(g) if 
the ratios \f(x)\/g(x) are defined and bounded for all x outside some compact set. 
If fi is also in J 7 , such that / — /i is O(g), then we may express this by writing 
f = f 1 + 0(g). 

This notation has a number of simple consequences. For example: 

(a) If / is 0(g), /i, </i G J 7 , and if |/i| < |/| and g < g u then f x is 0(g x ). 

(b) If fi is O(gi), i = 1, 2, . . . , m, then /i • f 2 • . . . ■ f m is 0(g x ■ g 2 • . . . ■ g m ). 

(c) If fi and gi are as in (b), and if ci, c 2 , ■ ■ ■ , c m are real numbers that are not all 
zero, then E^c^ is 0(Y? =1 \ Ci \ gi ). 
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Although the "big 0" notation gives only a very crude connection between the 
values f(x) and g(x) as x gets large in X, even this can sometimes be useful. For 
example, let us write lim^oo f(x) = L, for some real number L, if, for each positive 
integer n, there is a compact subset C n of X such that the relation \f(x)—L\ e [0, 1/n) 
holds outside of C n . Now suppose that / is 0(g) and lim^oo g(x) = 0. It then follows 
that Hindoo f(x) = 0. 

A more refined estimation relation, namely that of asymptotic approximation, may 
be defined as follows. Let / and g belong to J 7 . Then f(x)/g(x) is defined outside 
some compact set, and the expression lim x ^. 00 (f(x)/g(x)) makes sense. We say that 
/ is asymptotic to g, written / ~ g, provided that lim^oo f(x)/g(x) = 1. It is easy 
to check that ~ defines an equivalence relation on J 7 . 

Assuming additionally that g > outside some compact set, it is easy to check 
that / ~ g implies that / is 0(g). So ~ is a finer relation than big O. 

However, big O can be used to obtain ~ under some circumstances. Namely, choose 
any /,g 6 J, with g > outside some compact set, and let fi = f + 0(g). Assume 
that \im x ^ 00 (g(x)/ f(x)) = 0. It then follows that there exist a positive constant M 
and a compact set C C X such that 

-Mg/f < 1 - h/f < Mg/f 

outside C. This implies, first, that fi 6 T and, second, that fi ~ /. 

4.3. The Computational Strategy. Although the steps that we use in working 
toward a proof of Conjecture 1 are mostly of an elementary computational nature, 
the overall structure of the argument is intricate, and so we give here a brief overview. 

4.3.1. The numerator \Tri (1 Bk{R) H [— 2,2]| of (0). The set Tri appearing in the 
numerator of (jl]) is not computationally easy to work with, so we replace it by a 
larger set that is more computationally amenable. In particular, we use the function 
/ : R — > R of §4 — f(x) = x 3 — 3x — and Lemma 1 of §4 to conclude that 

Tri n B K (R) n [-2, 2] C f(K) n B K (R). 

Therefore, 



(5) 5 K (R)< I**)" 



\B K (R)n [-2,2]!' 

We shall show that the right hand side of inequality (E$ goes to zero as R — > oo, 
so we do not lose anything by this replacement in our effort to prove Conjecture 1 . 

4.3.2. The key computations for the numerator of (GJ). We see from equation (J2]) that 
K can be identified with the set of all integer (k + l)-tuples (ai, . . . , a^, b) such that 
b > and ax, ... ,ak,b are relatively prime. It is not hard to rewrite / using this 
identification. Bk{R) has a simple description in this notation, but a description of 
the intersection f(K)nB K (R) involves k+1 polynomial inequalities in ai, a 2 , . . . , a^, b, 
and so it is fairly complex. 
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When k is small, specifically, when k < 2, however, we are able to use the inequal- 
ities to find a function S : (0, oo) — > (0, oo) such that 

(6) S is 0(R 1/3 ) 

and 

(T) f(K)nB K (R)Cf(B K (S)). 

The proof of ([7]) follows a similar pattern for each number field K but details and 
specific bounds depend on the degree and discriminant of K. These proofs will occupy 
all of §7. 

Using the inequality for the density above, we then get 



S K (R) < 



< 



\f(B K (S))\ 
\B K {R) n [-2, 2] | 

\B K (S)\ 
\B K (R) n[-2,2}[ 



4.3.3. The denominator \Bx{R) H [— 2,2]]. The description of Bk{R) leads immedi- 
ately to an estimate for \B K (R)\. Indeed, as we shall see in §6.1, 

where ( is the classical Riemann zeta function. This will follow by generalizing an 
argument of Sittinger that proves a theorem of Lehmer (cf. [Sit J and §5). This 
asymptotic relation implies that 

o» \b.w\ < 

for sufficiently large R. 

However, even though the subset B K (R) D [—2, 2] is easy to describe in terms of 
ax, . . . , a,k, b and the basis V, its cardinality cannot be easily estimated. For example, 
we do not have good information about how the relatively prime, positive integer 
k + 1-tuples are distributed throughout the subset N fc+1 C M fc+1 , where N is the set 
of natural numbers, so we cannot proceed via some sort of volume computation. 

However, since our goal is a very crude estimation, we can circumvent this problem 
by defining a (relatively small) subset 



(10) Q(R)CB K (R)n[-2,2], 
for which we can prove that 

(11) \Q(R)\ 



(fc + l)^ 1 ||V||C(A;+l)- 



16 PETER J. KAHN 

Here ||V|| is a positive constant depending only on the basis V — in particular, not 
on R. This will occupy §6.2. It follows from (fTTj) that 

(12) lQm - wTww\m+Ty 

for sufficiently large R. 

Therefore, using ©, ©, (JTHJ, and (JT2J, we have 

(2S) k+1 (k + l) k+1 \\V\\ 

k+1 

(13) = 4||V| 



< 



(k + 1)5 



i? 



Since 5 is ^(i? 1 / 3 ), by 



(14) is °( R ~ Uk+1) )i 
which implies the following result. 

Theorem 4.1. Let K be a real number field of degree k < 2. Then, 

(15) ™ 0(R-i {k+l) ). 

This clearly implies Theorem 1.1. We state its extension to all real number fields: 
Conjecture 2. : S K (R) is 0{Rs^) } for any real, degree k extension KofQ. 

We now begin the proofs of the above results. 

5. A GENERALIZATION OF LEHMER'S THEOREM 

Choose integers k and n, with k > 2 and n > 1, and let Q(k,n) be the set of 
all relatively prime /c-tuples of positive integers < n. As before, we make use of the 
Riemann zeta function (. 



Theorem 5.1 (D. Lehmer, 1900). Fix the integer k. Then 
In more recent work [Sit] . B.D. Sittinger has shown that 

h 

n 

(16) \Q(k,n)\ = —r + 0(f k (n)), 

where fk{n) = nln(n), when k = 2, and fk{n) = n fc_1 , when k > 2. Equation (|T6l) 
immediately implies Theorem 15.11 (cf. §4.2). 
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We will generalize ( 1T61) in two ways. First, we replace n by a fc-tuple n = (m, • • • , nk), 
and second, we allow each rij to be an arbitrary real number > 1. Let Q(k,n) be 
the set of all relatively prime /c-tuples of positive integers (ai, . . . , a k ) such that each 
a>i where n satisfies the conditions just given. 

We think of Q(k, n) as a generalized "cube" with sides of length n*. The set Q(k, n) 
appearing in ( fl6|) represents the case in which all side-lengths equal a given positive 
integer n. 

It will now be convenient to introduce the notion of eccentricity of Q(k,n). We 
define this as follows: 



(n(1 \ \ max{ni,...,n fe } 

(17) e(Q(fc,n)) = — f . 

minjni, . . .,n k } 

Clearly, e(Q(k,n)) > 1, with equality holding if and only if all rii are equal. For 
any real number E > 1, let C% denote the set of all n e M. k such that each rii > 1 
and e(Q(k,n)) < E. This set inherits a locally-compact topology from ~R k ; we may 
call it the space of k-cubes of eccentricity < E. 

Next, we let 7(n) denote the geometric mean of the the n, comprising n, i.e., 

(18) 7 (n) = (n 1 -...-n fc ) 1 / fc . 
We use this to define a function /^(n) as follows: 



(19) / fc (n) 



7(n)ln(7(n)) : k = 2 
7(n) /c - 1 : jfe > 2. 



Clearly this gives one reasonable way to generalize the definition of the same-named 
function appearing in (fl6l) . 

We can now state the desired generalization: 

Theorem 5.2. Fix k and choose a real number E > 1. Then, for n ranging over C k E , 
we have 

\Q(k,n)\ = ni ' ak) nk +0(f k (n)). 

An easy computation that follows directly from the definitions shows that 

lim (/ fc (n)/m • . . . • n k ) = 0. 

n— »oo 

According to §4.2, Theorem 15.21 then implies that 
(20) |g(fe,n)l~ ni ' c{k) nk - 

The remainder of this section is devoted to a proof of Theorem 15. 2[ 
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5.1. The integral case. We begin by proving an analog of the theorem in which n 
ranges over the integral fc-tuples in C%. That is, n ranges over C% f~l Z fc . The proof 
follows that of Sittinger's proof of ( Fl6|) . with modifications to take into account the 
fact that not all the rij are equal. 

To simplify the notation in the computation, we adopt the following convention: 
whenever we have a fe-tuple of reals, say (zi, . . . , Zk), we shall write n(zi) to denote 
the product z\ • . . . • z\~. 

Using the inclusion-exclusion principle, we compute 

(21) 

\Q{k,n)\ = 7i(n i )-^2n([n i /p 1 })+ ^ irdn./p^})- ^(["i/piPaPa]) + • • • , 

Pi Pl<P2 Pl<P2<P3 

where [ ] denotes the greatest integer function and the pi range over the set of 
primes. Note that each of the terms [n^/pi ■ ■ -Pr] is zero when either r is sufficiently 
large or some p^ is sufficiently large. Therefore the expression on the right hand side 
reduces to a finite sum. As Sittinger does in his special case, we consolidate ( 12"T|) by 
using the Mobius function /i: 



(22) lQ(M)l = X>0>([ni/i])- 

i=i 

Clearly the summands for which j > mm{ni . . . , n^} all vanish. We now need a lemma 
to help evaluate the products n([ni/j]). 

Lemma 5.1. For any x = (xi, . . . , Xk) in C\, set 

0(x) = ir(xi)/ min{xi, . . . , x k }. 

Let y = y(x) be any function C\ — > C\ satisfying |xj — yi\ < 1 for all i = 1, . . . , k. 
Then 

vr(x 2 .) = tt( 2 / 1 ) + O(0(x)). 
We give a proof at the end of this section. 
Corollary 5.1. Let x and y be as in Lemma \5.1\ Then, 

n(x l ) = 7r(y t ) + 0( 1 (x) k ~ 1 ). 
Proof. Choose any y as in Lemma 15.11 The lemma implies that 

( 23 ) W^i) -n{y%)\ 



4>W 

->k 



is bounded for all x e C%. Now, rnax{x x , . . . ,Xk}/E > 7(x)/Z?. Therefore, by our 
eccentricity assumption, min{xi, . . . , x k } > max{xi, . . . ,x^}/E > 7(x)/£". It follows 
that 0(x) < i?7r(xj)/7(x) = E^(s) k ~ l . Combining this with (12"3| ) we conclude that 

|7r(gj) - Jc(yi)\ 
7(x) fc " 1 

is bounded, as desired. □ 
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We now return to our proof of Theorem 15.21 by applying Corollary 15.11 to the case 
in which 

r 7li 



Hi 

Xi = — and yi 
J 



J 



assuming that j < min{ni, . . . , n^}. Note that 

TK/i, • • • , n k /j) = 7(n x , . . . , n k )/j. 

Therefore, 15.11 yields 

(24) 7r([ ni /j}) = irim/j) + OftCn)*" 1 /^- 1 ). 

Feeding this into equation (1221) (and recalling that we may assume that j < 
minjrii, . . . , rik} and that the Mobius function fi assumes only the values and ±1 ), 
we get 

m m 

\Q(k,n)\ = Y,^j)<n t )/j k + Y,^j){<[n l /j])~<n l /j)) 

3=1 3=1 
m m 

= 5>(j>(^)// + o(£ Hmi^/j"- 1 ) 

3=1 3=1 

m m 

(25) = tt^^Mj)// + 0(7(n) fe - 1 ^l/j fe - 1 ), 

3=1 3=1 

where m = min{ni, . . . , ri^}. 

Next, Sittinger observes that, by the definition of the zeta function, 

m oo 
j=l j=m+l 

with the tail dominated by 

dt 1 



t k (k - l)m k - 1 ' 

(Recall that throughout this section, we are assuming that k > 2. 
Therefore, setting M = max{ni, ...,71*.}, 



(k — l)m k 

j=m+l v ' 

miT(ni)E k 



(k — l)M k 
E k m 



< 

~ k-1 
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Next, we compute 

m 

0(7(n)*- 1 £l/j*- 1 ) 



by observing that 



which is 



a i . , . , r dt 

^ _ 1S dominated by 1+ / 
j=i J 



0(ln( 7 (n)), k = 2, 
0(1), A;>2. 



Therefore, 
(27) 



C)Mn) k - 1 Vl/^ 1 ) is / Wn)M7(n)), = 2, 
^(7(n) 2^VJ ) 1 ^(Tfn)*- 1 ), fc > 2. 

.7 = 1 ^ 



Combining ( 1251) . ( )26|) . and ( 1271) . we obtained the desired result when n ranges over 

C% n z fc . 

Our next task is to derive from this the result for general n. 

5.2. The case of general real Axtuple n. Choose any n in C\ and set 

[n] = ([ni],...,K]). 

It is not hard to check that [n] e C\ E PI Z fe . 
By what was proved in the integral case, 

\Q(k, m )\ = ^ + 0(f k ( m )), 

for m ranging over C\ E D Z fc . 

It will be convenient to reformulate this as follows: There exists a constant H , 
depending only on E and k, such that 

(28) I |Q(fc,m)| - 7 ^7j7T-\ <H- f k (m), 

for m ranging over C\ E n Z fc . 

Clearly, [nj < n,, so that /^([n]) < /^(n). Further, it is immediate from the 
definition that Q(A;, [n]) = n). Therefore, replacing m by [n] in inequality (T2B1 
and using the foregoing observations, we get 

(29) \\Q(k,n)\-^^\<H.f k (n). 

Finally, by Corollary 15. 1[ there is a constant H', depending only on k, such that 

Wn i )-Tr([n i ])\<H'-7{n) k - 1 . 

Dividing this last inequality by C(fc), adding the result to (|29|) and using the defi- 
nition of //«(n), it follows that 
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\\Q(k,n)\-^\<H"-f h (n), 

where H" is a constant depending only on E and k. This translates to the desired 
statement involving big O. 

It remains to prove Lemma [5. 11 



5.3. Proof of Lemma 15.11 We begin with the identity 

k 

(30) ir(xi) - ir(yi) = - y h )yi ■ ... ■ y h -ix h +i ■ ■ ■ ■ ■ x k , 

h=l 

which can be proved by induction on A; or by a simple algebraic manipulation. Equa- 
tion (I3"U|) holds for all elements Xj, yj in any commutative ring. 

Next, since the hypothesis of the lemma states that \xi — yi\ < 1, for all i, f[3"Uj) 
implies that 

k 

(31) Info) - 7r(j/i)| < ^(1 + x x ) ■ . . . • (1 + • • . . . • x k . 

h=l 

We may write 

(1 + xx) ■ ■ ■ ■ ■ (1 + = 1 + cti _1 (xi, . . . , x h _i) + . . . + cr^(xi, . . . , x h -x), 

where cr^ _1 is the a th elementary symmetric function in h — 1 variables. Therefore, 

k h-l 

\lt(Xi) - Z(yi)\ < 22z2^a~ 1 ( x ^ ■ ■ -, x h-l) x h+l ■ ■■■■ x k . 
h=X a=0 

Now o"^ -1 is the sum of all products of a distinct unknowns selected from the h — 1 
unknowns. So 

°~a 1 ( x l) • • • ) x h-l) x h+l ' • • • " x k 

consists of C 1 " 1 ) terms, each of the form 

(32) Xi x ■ . . . ■ Xi a ■ x/i+i • . . . ■ x k . 

Let {jx, ■ ■ ■ ,jb} denote the complement of {ix, . . . , z a } in {1, 2, . . . , h—1}, where a+b = 
h — 1. Then, we may rewrite expression ( 132]) as 
^22^ xx ■ ■ ■ ■ ■ Xh-x ■ Xh+x • • • • ■ x k 

x h ' ■ ■ ■ ' x j b 

Since each X{ > 1, the expression in (1331) is < 0(x). Therefore, 

k h-l 



W(xi)-n(Vi)\ < EE^/V^ 

h=X a=0 ^ ' 

= ^ (x) (^ 2 "") 



(2' - l)0(x). 
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This immediately implies Lemma \5. II and, with it, completes the proof of Theorem 

6. Bounds on the numerators and denominators of the density 

estimate 

6.1. Estimating the cardinality of Bk(R)- 

Recall that Bk(R) consists of all relatively prime integer (/c+l)-tuples (ax, . . . , a*-, b) 
such that each |aj| < R and < b < R. In this section, we obtain the following 
estimate: 

ok r?fc+i 

(34) \ Bk (R)\ = ^-^ + 0(J 1h . 1 (R)). 
Since 

fk+i(R) n 

fi-^oo R k + l 

it follows that (cf. the last paragraph in §4.2p 

(35) 1^(72)1 - " 



C(* + i)' 

Our derivation of the estimate is based on Theorem 1 5. 2 1 However, that theorem 
refers only to tuples whose entries are positive integers. So, we must see how to include 
zero and negative entries into our count. To do this we first introduce some extra 
notation. 

Let I and J be disjoint subsets of {1, . . . , k}. 
Define 

di < i E I 

(36) B K (R;I,J;k) = { ( di, . . . , Ofc, b) G Bk{R) '■ on — i G J 

a* > <^ otherwise. 



In the notation of §H1 

B K (R; 0, 0; k) = Q(k + 1, R), 

where R is the (k + l)-tuple (R, . . . , R). 
Therefore, according to Theorem I5.2[ 

(37) \B K {R- 0, 0; k)\ = + 0(f k+1 (R)), 

where 
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For each subset /, the set Bk(R; I, 0; k) consists entirely of relatively prime (k + l)-tuples 
(a 1; ...,%,&) for which all the entries are non-zero. Since changing the sign of one or 
more of the Oj's does not affect their absolute values or divisibility properties, such 
sign changes can be used to define a bijection between any two of the Bx(R; I, 0; fc)'s. 
Of course, they are all pairwise disjoint. So, using equation (|37j) . we obtain 

(39) | \jB K (R-I,(/)-k)\ = ^ j -—^ + 0(f k+1 (R), 

where |J denotes the disjoint sum. 

The remaining (& + l)-tuples in Bx(R) consist of those for which some a,'s are zero. 
These all sit in "lower dimensional" cubes, and so their contribution gets absorbed 
by the big O notation. We make this precise as follows. 

Suppose first that k > 1, < m < k, and J is a subset of {1, ... , k} of cardinality 
m. Then delete the elements of J from {1, . . . , k}, and renumber the remaining num- 
bers, in order, using {1, . . . , k — m}. Given any / disjoint from J as before, renumber 
it using the renumbering just obtained. This produces a subset I' of 
{1, . . . , k—m}. These operations on indices determine a bijection between Bk{R] I, J] k) 
and Bk{R] I', 0; k — m). Therefore, for any fixed, non-empty J of cardinality m, equa- 
tion fl39l) implies that 

(40) | \_\B K (R;I,J;k)\ = — — - + 0(f k - m+1 (R). 

If J* is any other non-empty subset of {1,...,A;}, then |_l 7 Bk(R; I, J] k) and 
I_| 7 * B K (R; I*, J*; k) are disjoint. They have the same cardinality when \J\ = |J*|. 
Therefore, letting J range over all non-empty proper subsets of {1, . . . , k}, we have 

1 / / h. \ ofc— m r)k— m+1 \ 

(41) iuu w ,^)i= E (u) c(t .- m+1) +0 '^' R ))' 

J^0 / \J\=m=l V V 7 SV 1 7 

We leave to the reader the check that the expression on the right is 0(fk+i(R))- 
Now consider the case m = k. Then J — {1, ... ,k}, and the only possible set / is 

the empty set. In this case the left-hand side of equation (T40l reduces to \Bk(R; 0, J)\. 

But the cube Bk(R; 0, J) is just the singleton set consisting of (0, . . . , 0, 1), and so, 

allowing the case J = {1, . . . ,k} in the expression on the left-hand side of ( l4Tj) . we 

still get that its cardinality is 0(fk+i(R)) 

A similar special argument applies to the case k — 1, which we leave to the reader. 

We can now conclude: Since Bk(R) is precisely the disjoint union of | | 7 Bk(R] I, 0; k) 

and |_lj^0 U/ Bk{R, </; k), estimate ( 1341) follows immediately. 
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6.2. Defining Q(R) and estimating its cardinality. We recall from Section H] 
that the definition of density as well as all the related concepts and computations 
began with a choice of basis V = {v\, . . . ,Vk} of the field K over Q. We made no 
assumptions about V. It will now be convenient, for notational and computational 
simplicity, to make the assumption that each real number Vi is > 1 (cf. Appendix B). 
Set ||V|| = ir(vi) (= v i • . . . • v k ). 
Define m and n in IR fc+1 as follows: 

k p 

m k+ i — K, 
k+1 

Kk+l = R- 

Then the set Q(R) is defined to be the set difference 

(42) Q{R) = Q(k + l,n)\Q(k + l,m), 

where we use the "cubes" defined in Section [5j Since Q(k + 1, m) C Q(k + 1, n), we 
have 

(43) \Q(R)\ = |Q(A; + l,n)|-|Q(fc + l,m)|. 
According to Theorem 15.21 and the definition of Q(k + 1, n), 

(44) l^ + 1 -""- ( t . + lW |y|| C ( t . + 1 )) +0 <^'")'- 

Applying Theorem 15.21 to |Q(A;,m)|, and using the fact that /fe + i(m) < fk + i(n), 
we get, similarly, that 

k2 k R k+1 

(45) |Q( t + l,m)| = (t + 1)t+1||v||c(fc+1) +0 (A +1 )(n), 

where we think of n as a function of m. 
Therefore, combining (|44p and 



lQ{R)l = (k + m\V\\C(k +l)~(k + l) fe+1 ||V||C(A; + 1) + ° (/fe+l(n)) 

+ 0(A+i(n)). 



(A; + l)*+i||V||C(A; + l) 

An easy computation shows that the geometric mean 7(11) is given by 

1 

/ 2 k \ fe+1 

l{n) = R \ (k+inv\\ ) ' 

so that, setting D equal to the coefficient of R in this expression, we get 



/fc+i(n) 



DRln(DR), k = l 
D k R k , k > 1. 
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Therefore, we obtain 

(«) iqwi = (t+1)W||v|l((t+1) + <K*»m 

where, here, Fk+i(R) is obtained from ff. + i(p) above by deleting all reference to the 
constant factor D. 

Again, as before, we obtain from the above big O relation the corresponding as- 
ymptotic relation 

(47) \Q{R)\ - " 



(k + l) fc+1 ||V||C(& + 1) 



6.3. Using \Q(R)\ as a lower bound for \B K (R) n [ — 2, 2] | . 

Lemma 6.1. Q(R) C B K (R) n [-2,2]. Hence, \Q(R)\ < \B K (R) n [—2, 2] | 

Proof. Referring to the defining equation for Q(R) (equation f|42l ). we note that 
since Q(k + l,n) is a subset of B K (R), by construction, we need only check that 
(a 1; ...,%,&) in Q(R) satisfies 

_ 2 < aiVi + • • • + dkVk < 2 
— 6 — 

Moreover, since all the terms in the middle expression are positive, it remains only 
to verify the right-hand inequality. 

Choose any (ai, . . . , a k , b) in Q(R). Then, by construction, 

2R 

0<a l < WT ^, . = 1,...,* 



and 



Therefore, 



k R <b < R. 



k + l 



2k D 



as desired. □ 

We may now use Lemma I67H together with the asymptotic estimate (j!7|) . to get a 
lower bound for \Bk(R) H [— 2, 2]|. In particular, choose any e G (0, 1). Then, 

|fl*(i2)n[-2,2]| > " " 



(fc + l)*+i||V||C(fc + l)' 
for R sufficiently large. This is clearly a vast underestimate in general, but it will 
do for our purposes. 
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7. Proof of Theorem 14.11 

Recall that Theorem 14.11 asserts that 

5 K (R) is £>(iT§ (fc+1) ) 

when K is a real field of degree k < 2. As shown in Section HJ in the presence of the 
estimates in the preceding section, this follows from the existence of a function 

S : (0,oo) ->■ (0,oo), 

such that 

(a) S is and 

(b) f(K)nB K (R)Cf(B K (S)). 

Recall that / here is the polynomial function given by f(x) = x 3 — 3x. 

In this section we construct such a function S. In general, S will depend on K, 
although the basic form and idea of the construction will be the same for each K. 

Note that a typical element in the left-hand set in b) above is of the form f(a) 
such that the height hxifioi)) is < R. In order to gain usable information from this 
fact, we must be able to compute this height or some bound on the height in terms of 
the data supplied by a. The problem we initially face is that, for any (3 G K, hx{(3) 
is defined in terms of a canonical representation of /3 in terms of the selected basis V 
of K (cf. (2)). The k + 1 integers appearing in this representation are assumed to 
be relatively prime. However, although this is what we may assume for the integers 
appearing in the representation of a, when we apply / to this representation and 
expand to get the result into the appropriate form, the integer coefficients we get 
need not be relatively prime. Our first task, therefore, is to obtain a bound on the 
greatest common divisor of these coefficients. 



7.1. Bounding the greatest common divisor. When K — Q, there is no problem. 
For if we choose a/6 G Q, where a and h are relatively prime integers and b > 0, then 
f(a/b) = (a 3 — 3b 2 a)/b 3 , and it is easy to check that numerator and denominator are 
relatively prime. So we now turn to real fields of degree 2. 



Real quadratic fields are known to be of the form Q(yd), where d is any positive, 
square-free integer. In this case, we choose the basis V to be the set {1, v^}- V 
consists of integral elements of K, but we do not use this fact. Every a in K may be 
written uniquely as 

, AO , a 1 + a 2 y/d 
(48) a = , 

where ai,a,2,b are relatively prime integers and b > (cf. ([2])). Now apply / to (j4"8"j) 
to obtain 



a 



(49) f(a) = a 3 -3a 



I + 3daia\ — 3ai6 2 ) + (3afa 2 + da 3 , — 3a 2 b 2 )Vd 
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In this subsection, we write the long expression as 

A 1 + A 2 Vd 
B 

to simplify notation. Often, we shall use the triple (A 1: A 2 , B) instead of this fraction. 
Let G denote the greatest common divisor (g.c.d.) of A 1: A 2 ,B. 
Using this notation, we can express the height /^(/(a)) as follows: 

max(|Ai|, \A 2 \,B) 



(50) h K (f(a)) 



G 



Lemma 7.1. G\8d 

Proof. Let p be a prime dividing G, and suppose that p\a\. Since p\B, we know that 
p\b, and so we cannot also have p\a 2 . Therefore, using p\A 2 , we have p\3a\ + da\ — 3b 2 . 
This implies p\d. 

Next, suppose that p 2 \G and also p\a±. Then, since p 2 divides 3a\a 2 — 3a 2 b 2 as well 
as A 2 , we have p 2 \da\. We still cannot have p\a 2 from the above argument, so p 2 \d, 
which contradicts the fact that d is squarefree. 

Therefore, any common prime factor p of G and a± must be a factor of d and occurs 
only to the first power in G. 

Now suppose that a prime p divides G but p does not divide a±. In this case, p 
divides Ai/ai = a\ + 3da\ — 3b 2 and also b, so p cannot divide a 2 . This implies that 
p divides A 2 /a 2 = 3a\ + da\ — 3b 2 . Hence p divides both 

al + 3da\ 

and 

3a\ + dal, 

which implies that p\Sa"l, hence p|8. Therefore, in this case p = 2. 

Still sticking to the case p\G and p |/ai (so p = 2), suppose that 2 4 |G. Since a± 
and a 2 are both odd in this case, and odd numbers represent invertible elements in 
the ring of integers mod 16, we may divide A\ by a\ and A 2 by a 2 in that ring to 
obtain congruences 

a\ + 3da\ -3b 2 = mod 16 

3a\ + dal - 3&2 = mod 16 - 
By our assumption on G, we have 16|-B = b 3 , which implies that 4\b, hence b 2 = 
mod 16. Therefore, the above equations become 

a\ + 3da\ = mod 16 
3a\ + da\ = mod 16. 

Subtracting the first of these from three times the second, we get 

8aj = mod 16, 

a contradiction since a x is odd. 

Therefore, the highest power of 2 dividing G is < 2 3 . 

The result is now immediate. □ 
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Applying the lemma to equation ( l50i) . we get 
Corollary 7.1. 

, /,/ ^ . maxd^xl, \A 2 \,B) 
M/(«)) > . 



7.2. A certain cubic curve. The estimates that we want to make to conclude the 
proof of Theorem 14.11 all involve features of a certain cubic function: 

$d,e(x) = D(x 3 -3E 2 x), 

where D and E are positive real parameters. 

Lemma 7.2. Choose any real T > and suppose that E < T 1 / 3 . If x > T 1 / 3 + E, 

then ®d,e( x ) > DT ■ 

Therefore, making use of the contrapositive, &d,e(x) < DT x < 2T 1 / 3 . 

The proof is an exercise in elementary calculus and so will be omitted. 
Note that since <&d,e is an odd function of x, Lemma \7. 21 implies that for T and E 
as in the lemma, 

®d,e{x) > -DT => x > -2T 1/3 . 

7.3. Constructing S. We continue with the notation of Section I7TT1 

7.3.1. The case K=Q. . Recall that, for a/b e Q, a, b relatively prime and b > 0, we 
have 

Mb) = = ^M/b 3 - 

Therefore, applying Lemma l7T2T c). with T = R, D = 1, and E = S, we may conclude 
that if $! 5(a) < R and b < R 1 ^ 3 , then a < 2R 1 ^ 3 . Similarly, by the remark following 
the lemma, if > —R and b < i? 1//3 , then a > — 2i? 1//3 . Using the fact that the 

numerator and denominator in the above expression for f(a/b) are relatively prime, 
we may interpret the foregoing as saying that 

f(a/b) G B K (R) a/b G B K (2R 1 ' 3 ). 

Now apply / to the right hand side of this implication to conclude that 

f(K)nB K (R)Cf(B K (2R 1 ^ 3 )). 

This argument shows that we may define the desired function S by 

S{R) = 2R 1 ' 3 , 

thus concluding the proof of Theorem 14.11 in the case K = Q. 
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7.3.2. The case of real quadratic fields. We refer to Section 17.11 and particularly to 
expression (|49p to point to the notation that we shall be using here. 

(a) We assume throughout this part that B < 8dR so that b = B 1 / 3 < (8dR) l l z , 
where R is an arbitrary positive real as before. (Here 8dR will correspond to the real 
number T appearing in Lemma [7.21 and b will correspond to the parameter E.) 

Suppose now that a\ > (8dR) l l 3 + b. In particular, a\ is positive, so we have 
inequality 

A\ = a\ + 3da±al — 3a\b 2 > af — 3b 2 a± = $1,6(01). 
Using Lemma [7. 2\ we get the further inequality 

®i,b( a i) > 8c ^- 

Therefore, (always assuming B < 8dR), we get the implication 

ai > (8dR) 1/3 + b A x > 8dR. 

Using the contrapositive version of this, we conclude that we have the implication 

(51) A l = a\ + 3d ai a 2 2 - 3ai& 2 < 8dR 01 < (8dR) 1/3 + b< 2(8dR) 1/3 . 

Similarly, using the fact that c&^&pT) is an odd function of x, as mentioned after 
Lemma [7. 2\ we may also conclude that when b < (8c/i?) 1//3 , we have the implication 

(52) A 1 = af + 3da x a\ - 3a x b 2 > -8dR =► ai > -2(8dR) 1/3 . 
Consolidating these, we get 

(b) We now apply a similar argument to 

A2 = 3a\ + da\ — 3a2& 2 . 

We assume throughout this part that B < 8R. (Here 8R will correspond to the real 
number T appearing in Lemma 17.21 and c — b/ yd will correspond to the parameter 
E.) 

Suppose that a 2 > (8R)^ 3 + c. Of course a 2 is positive, so that we get the inequality 
A 2 = 3a 2 + da\ — 3a 2 b 2 > da 3 , — 3a2& 2 = d(a\ — 3c 2 a 2 ) = $^ c (a2), 
We can now apply Lemma 17.21 again to get the further inequality 

$d, c (a 2 ) > 8dR. 
Thus, as above, we get an implication 

a 2 > (8R) 1/3 + c => A 2 > 8dR. 
A similar argument yields, in the presence of the assumption B < 8dR, 

a 2 < -(8R) 1/3 - c A 2 < -8dR. 
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Combining these two implications and passing to contrapositives, we get 
\A 2 \ < 8di?} l« 2 | < 2(8i?) 1/3 < 2(8^) 1 /3. 

(c) We now wrap things up and finish the proof of Theorem 14.11 

Suppose that f(a) G Bjc(R). That is, /^(/(a)) < R. By equation ( )50l) and 
Corollary 17.11 we get 

max(|Ai|, \A 2 \,B) max(\A l \,\A 2 \, B) 

8d " G = M/(a)) " *> 

so that 

max(|Ai|, \A 2 \,B) < 8dR. 

The conclusions in (a) and (b) above imply that hx(a) = max(|ai|, \a 2 \, b) < 2(8<iR) 1 / 3 . 
Therefore, a G B K (2(8dR) 1 ^ 3 ), implying that f(a) G f(B K (2(8dR) 1 / 3 )), i.e., f(K) n 
B K {R) C /( J B^(2(8di?) 1 /3)). 

The desired conclusion now follows by defining the function 5* = S(R) by 

S{R) = 2{8dR) 1/3 . 

This completes the proof of Theorem 14.11 



8. Some further comments 

Several further directions are possible for the inquiry begun by this paper. 

For example, one could attempt to prove Conjecture 1 for other real number fields 
K or for all of them. And one could attempt to prove the sharper Conjecture 2 
for fields of low degree. There is also the global question, presumably substantially 
harder, of obtaining the density of Tri in A fl K. 

Another kind of problem would be to improve the estimates given in this paper, 
even for fields K of small degree. If one follows the arguments given here, it seems that 
what is required is further information on the "geometric" distribution of relatively 
prime fc-tuples of integers. For example, I do not know whether the distribution is 
uniform throughout lR fc . Perhaps analytic number theorists have looked at this, but 
I do not know of any such results. 

Along another line, we mentioned briefly that the set of angles that are both con- 
structible and trisectable forms a countable subgroup of the circle group. It would be 
interesting to obtain further information about this group. 

Finally, one might attempt to solve similar estimation problems for p-sectability, 
either for various specific primes p or for primes p in general. (See Appendix A below 
for some preliminary results in this direction.) Since the key trigonometric equations 
are much more complicated for p > 3, other techniques are probably required. 
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Appendix A. n-sectability of angles 



Let n be any positive integer. The reader may naturally wonder what n-fold sub- 
divisions are achievable for all angles via Euclidean ruler and compass construction. 
Of course, when n has the form 2 k , such subdivisions are always possible via iterated 
bisection. The case n = 3 was settled by P-L. Wantzel, as we have discussed in the 
main body of this paper. The case of general n is settled by the following theorem: 

Theorem A.l. Suppose n is a positive integer such that, for any angle a, there exists 
a Euclidean construction that starts with a and produces a/n, i.e., suppose that every 
a is n-sectable. Then, n has the form 2 k , for some non-negative integer k. 

Proof. Case 1: Assume that n is an odd prime. We write n = p. 

The standard identity (exp(ip9) = exp(i9) p yields the following trigonometric for- 
mula: 



where q = |(p — 1). Set a = cos(p9) and x = cos(6 l ). Then, we have an equation 



We regard P(x, a) as a polynomial in x with parameter the constant term a, anal- 
ogous to the polynomial p(x, a) defined in §1.2. Indeed, the explicit relationship 
between P(x,a) and p(x, a) when n = 3 is given by 2P(x,a) = p(2x,2a). We now 
obtain information about the coefficients of P(x,a) in equation ( 1531) . 

a) The top-degree monomial in P(x, a) is 2 p ~ 1 x p . 

To see this, we consider the summands on the right hand side of equation ( 153"j) , and, 
holding &;-fixed, we see that the maximum exponent obtainable occurs when £ = k, 
yielding x p . This is independent of k, so x p is the maximum power of x occurring 
in the formula. This would imply that the degree of P(x, a) is p, provided that the 
coefficients in the formula satisfying I = k do not sum to zero. 



The terms with £ = k have coefficient sum YH=o(~^-) 2k iik) = Sfc=o (2D' wm ch 



1 v^P ( P\ _ op-l 

2 Z^m=0 Km) 

This verifies that P(x, a) has degree p with leading coefficient 2 P_1 . 

b) Except for the leading coefficient and the parameter a, every coefficient in P(x, a) 
is divisible by p. Indeed the coefficient of the first-degree term is (— l) q p. 

The coefficient (— l) fe+f \ (\ is clearly divisible by the prime p as long as k 7^ 0. 
This implies the first statement. For the second statement, set p — 2k + 2£ = 1. Then 




(53) 





32 



PETER J. KAHN 



it follows that < £ = k — \{p — 1) = k — q < 0. Therefore £ = and k = q, which 
implies that the coefficient of x in P(x, a) is as stated. 

We now can apply the Eisenstein criterion [WaeJ to the polynomial P(x, a) for 
appropriate choice of the parameter a. In particular, choose any integer c that is 
divisible by p but not by p 2 , and let d be any positive integer prime to c such that that 
— 1 < c/d < 1. Then d p P(x,c/d) is a polynomial in Z[x] whose top coefficient is not 
divisible by p, whose remaining coefficients are divisible by p, but whose constant term 
is not divisible by p 2 . These are precisely the conditions under which the Eisenstein 
criterion implies that d p P(x,c/d) is irreducible in Z[se], except possibly for constant 
factors. It follows immediately from Gauss's Lemma that P(x,c/d) is irreducible in 

Q[x). 

The argument now is almost identical to the case n = 3 argued before. Choose 
a such that cos(a) = c/d. Then cos(a/p) is a zero of P(x,c/d), by equation (j531 
and the preceding trigonometric formula. Suppose cos(a/p) were constructible over 
{(0, 0), (1, 0), a}, and let / be its minimal polynomial. Then the degree of / over 
Q(cos(a)) = Q(c/d) = Q is divisible by a power of 2 and / is a factor of P(x, c/d) in 
Q[x], contradicting the irreducibility of P(x, c/d). Therefore, a/p is not constructible, 
concluding Case 1. 

Case 2: General n. We begin with a simple general observation. 

Lemma A.l. Suppose that k is a factor ofn. If a is n-sectable, then it is k-sectable. 

Proof. Write n = kl. Since a/n is constructible over {(1, 0), (0, 1), a}, so is the 
multiple a/k = £a/n. □ 

Now suppose that n is not a power of 2. Then it has an odd prime factor p. Let a 
be an angle that is not p-sectable, which exists by Case 1. Then, by Lemma [A. 14 a 
is not n-sectable. This proves the theorem. □ 



Remarks: (a) The above argument can be slightly elaborated to imply that if r is 
a rational number strictly between and 1, and if, for any given angle a, there is a 
Euclidean construction that starts with a and produces the angle ra, then r must 
have the form k/2 , for some integers k and £, where £ > 0. Of course, when r does 
have that form and any a is given, the angle ra can be constructed. 

(b) Let a be any angle, and set a = cos(a) as before. Choose any positive integer n. 
Then, as we have seen above cos(a/n) is algebraic over Q(a). In particular, cos(a/n) 
is an algebraic number whenever a is. It follows that the non-rz-sectable angles a 
produced in the proof of Theorem A.l have algebraic cosines. 

Proposition A.l. If n is a positive integer such that 2ir/n can be constructed (i.e., 
the regular polygon ofn sides can be constructed), then there exists a countable dense 
subset of S 1 consisting of n-sectable angles. 
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Proof. This follows from the construction of Yates described earlier in §3.1. Namely, 
let m be any positive integer prime to n, and choose integers a and b such that 
an + bm = 1. Multiply this equation by the quantity 2tt /ran Then, (l/n)(2ir/m) = 
a(27r/m) + b(2n/n), showing that 2ir/m is n-sectable. The set of such angles 2n/m 
is clearly countable and dense in S 1 . □ 

As is well known, Gauss showed that a necessary and sufficient condition for 27r/n 
to be constructible is that <f>(n) be a power of 2, where <p is the Euler function. 
Examples of odd n for which this is true are: n — 3, 5, 17, 257, 65537. 

Proposition A. 2. If n is a positive odd integer such that 2ir/n can be constructed , 
then there exists a countable dense subset of S 1 consisting of non-n-sectable angles. 

Proof. To see this, suppose that n is an odd number such that 27r/n is constructible, 
and suppose that (3 is a non-n-sectable angle. By the argument for Proposition 
A.l, every integer multiple of ir/2 k is n-sectable, for any k > 0. We claim that 
7 = (3 + cir/2 k is not n-sectable, for every integer c and every k > 0. The argument is 
essentially that given in §1.2. We give it here for the reader's convenience: Suppose 7 
were n-sectable. Then, starting with (3 we could construct 7 and then 7/n. Construct 
C7i/n2 k and subtract this from 7/n, obtaining 0/n. This would provide a Euclidean 
n — section of /3, which is impossible. The set of all these non-n-sectable 7's is clearly 
countable and dense in S 1 . □ 

If cos(/3) above is an algebraic number — and such exist for every n that is not 
a power of 2, by Remark (b) above — then it is easy to see that cos(7) is algebraic. 
Therefore, the foregoing gives a countable dense set of non-n-sectable angles with 
algebraic cosines. The case of transcendental cosines is handled by the next result, 
which extends Corollaries 13.11 and 13. 2[ 



Theorem A. 2. Suppose that cos(a) is transcendental and that n is a positive integer 
that is not a power of 2. Then a is not n-sectable. Therefore, the set of all non-n- 
sectable angles is uncountable, and the set of all n-sectable angles is countable. 

Proof. We use the notation introduced above. In particular, we use the polynomial 
P(x,a) in (153]) . where a = cos(a). Let P(x,t) e Q[t] be the polynomial obtained 
from P(x, a) by replacing a by the indeterminate t. 

Assume first that n is an odd prime p. We claim that P(x,t) is irreducible in 
Q[t]. To see this, choose c/d as in Case 1 of the proof of Theorem A.l. Let \ be 
the Q-algebra homomorphism that sends t to c/d, so that x(P(x,t)) = P(x,c/d). 
Since P(x,c/d) is irreducible, by Case 1 of the proof of Theorem A.l, P(x,t) must 
be irreducible, as claimed. 

It follows immediately that P(x, a) is irreducible, because Q[t] = Q[a] when a is 
transcendental. 

We now argue as we did earlier. Since n is not a power of 2, it has an odd prime 
factor p. If cos(a/p) is constructible over Q(a), then its minimal polynomial over 
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Q(a), say /, has degree a power of 2. In particular, the degree does not equal or p. 
But cos(a/p) is a zero of P(x, a), so / divides P(x, a), contradicting the irreducibility 
of P(x, a). 

Therefore, a is not p-sectable. Applying Lemma \A.1\ we conclude that a is not 
n-sectable. 

The last two statements of the theorem simply use the standard facts about tran- 
scendental numbers and algebraic numbers. □ 



Appendix B. Change of basis 

Let Vi = {v i, . . . , Vk} and V 2 = {wi, . . . , w^} be Q-vector space bases of the real 
number field K, and let hi and h 2 be the corresponding height functions, as defined 
in §7.1. We show first that hi and h 2 are commensurate. More precisely, we show 
that there is a positive integer d such that 

(54) -h 2 <hi< dh 2 . 

d 

Let T = [tij] be the k x k matrix of rational numbers given by 

k 

w j = ^2t ij v i , j = l,...,k. 

i=i 

We say that T is elementary if one of the following is true: (a) T represents a 
permutation of the basis V\. (b) T represents the addition (resp., subtraction) of one 
basis vector of V\ to (resp., from) another, (c) T represents the multiplication of one 
basis vector of Vi by a non-zero rational number while fixing the others. 

Of course, every product of elementary matrices is invertible. Elementary row- 
reduction would imply the converse, except row reduction allows a slightly richer 
class of elementary matrices of type (b) . However, it is easy to see that these can be 
obtained by multiplying suitable elementary matrices of the above type. So, every 
invertible matrix is a product of ones that are elementary in the above sense. 

Lemma B.l. IfT is an elementary matrix, then (54\ ) holds. 

Corollary B.l. Let hi and h 2 be height functions corresponding to two choices of 
rational bases of K . Then, there exists a positive integer d satisfying the inequalities 

•9- 

Proof. Let T be the change of basis matrix, factor it into a product of elementary 
matrices, and apply the lemma successively to these. □ 

It remains to prove the lemma. 

Proof, (a) When T is a permutation matrix, the representations of a field element 
a in terms of the two bases differ only by a permutation of the coefficients. But 
the definition of the height function shows that it is invariant under permutation of 
coefficients, so that hi = h 2 in this case: i.e., the inequalities are satisfied for d = 1. 
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(b) Without loss of generality, let us assume that the basis change involves adding 
(resp., subtracting) v 2 to (resp. from) V\ and fixing the other vectors. Then, writing 

(55) a = (a 1 v 1 + . . . + a k v k )/b, 
as in §7.1, we have 

a = (aiWi + (a 2 =F ai)w 2 + a 3^3 + • • • + a k w k )/b. 

It is easy to see that a>i, . . . ,a k ,b are relatively prime if and only if ai, (a 2 ^fai), a^, . . . ,a k ,b 
are relatively prime. Therefore, assuming the former,we get 

h 2 (a) = max{|ai|, \a 2 =F a\\, \as\, ■ ■ ■ , |afe|, b} < 2 max{|ai|, . . . ,\a k \,b} = 2h\{ot). 

A symmetric argument proves the same inequality with hi and h 2 exchanged. There- 
fore (jS"%l) holds in this case with d = 2. 

(c) Here we do not lose generality by assuming that £jj = 8y (the Kronecker delta) 
when ^ (1, 1), and tu = t, where t is a rational number, which can be written 
"in lowest terms" as t = r/s. Then, with a as above in ( 1551) . we may write 

a = (ai/t)w 1 + a 2 w 2 + ... a k w k )/b = ((sa^Wx + (ra 2 )w 2 + . . . + [ra k )w k )/rb. 

Notice that in this last representation of a, the integers sai,ra 2 , . . . ,ra k ,rb may 
not be relatively prime. So, we cannot apply the usual formula for the height func- 
tion to these. However, if we divide all these integers by their greatest common 
divisor, G, then the resulting integers are relatively prime, and they do result from a 
representation of a. It follows that 

(56) h 2 (a) = maxjsax, ra 2 , . . . , ra k , rb}/G. 

To proceed further, we need some sort of upper bound for G. This is provided by 
the following claim: G divides rs. To see this, suppose that p is a prime dividing 
G and that p m is the highest power of p dividing G. If p m fails to divide r and p m 
fails to divide s, then p must divide a\, . . . , a k and b, a contradiction. Therefore, p m 
divides r, or it divides s. Hence it divides rs. It follows that G\rs. 

We now apply this to equation (|56l) . 

h 2 (a) > max{sai, ra 2 , ■ ■ ■ , rak, rb}/\rs\ 

> min{|r|, |s|} max{|ai|, . . . , \a k \, b}/\rs\ 
= /ii(a)/max{|r|, \s\}. 

Thus, the right-hand inequality in (|54p holds in this last case as well, with d = 
max{|r|, \s\}. A symmetric argument produces the left-hand inequality. □ 

Next we use Corollary IB. II to show that our conjectures and results concerning 
density do not depend on the choice of the basis. 

If h\ and h 2 are height functions satisfying (15^1) . and if 

(57) J B i ( J R) = / i " 1 [0,/2), 
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then Corollary IB.ll implies that 

(58) B^R/d) C B 2 (R) C Bi(dfl). 

We recall the definition of density, with respect to each height function: 

(59) m) = \[-2,2]nB l{ RW 

for % = 1,2, (cf. §7.1). We shall show that S^R) is O(R-I^) <=> 5 2 {R) is 

e>(ir§( fe+1 )). 

Of course, by symmetry we need only prove one implication, say =K 
Using (EH} for i = 2, together with (J58"j) . we have 



S 2 (R) < 



\Tri C\ B!(dR)\ 



-2,2}nB 2 (R)\- 
Using fl58l) again, we get 



-2,2] HB^R/d)] K ' |[-2, 2] n Si (i2/d) | ' 

We now use the "box" Qi(R/d) constructed exactly as Q(R) is constructed in 
§6.2, equation (41). The inclusion Qx(R/d) C [—2,2] HB^R/d) follows exactly as in 
Lemma 6.1, so we get 

lgi(gg)| 

\Qi{R/d)[ 

Just as in §6, one shows that both \Bi(dR)\ and \Q\{R/d)\ are 0(R k+l ). It follows 
that their quotient is bounded, say by M. 

Thus, we have shown that 5 2 {R) < M5\{dR). From this, the desired implication is 
immediate. 



S 2 (R) < Si(dR) 



Appendix C. Constructible non-trisection numbers of arbitrarily 

HIGH DEGREE 

Since the standard examples of non-trisectable angles — namely, tt/3 + tt/2 u — are 
constructible, it may be of some interest to obtain information about how complicated 
they are to construct, by which we mean the minimal number of Euclidean ruler and 
compass steps it would take to construct them. This problem is certainly not well- 
posed, but even without going into a lengthy analysis, we can probably agree that the 
log 2 of the algebraic degree of cos(7r/3 + 7r/2 n ) (or, 2cos(7r/3 + vr/2™)) gives a weak 
lower bound. It ignores all the "rational" constructions required, but it does count 
the "quadratic" ones. 

In this appendix we prove the following: 

Proposition C.l. The degree of 2 cos(7r/3 + vr/2 n ) is 2 n . 

The proof is an extended exercise that uses standard trigonometric identities and 
well-known facts about field extensions. 
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C.l. Basic identities and computations. Define the numbers a n ,b n ,c n ,d n as fol- 
lows, for all n > 0: 



(60) a n = 2cos(7r/2 n ) 

(61) b n = 2sin(7r/2 n ) 

(62) c n = 2cos((7r/3) + (vr/2 n )) 

(63) d n = 2cos((tt/3) - (vr/2 n )) 

It is easy to prove, say inductively, that these numbers are all algebraic. In fact, 
since the angles in question are all obviously constructible, so are their sines and 
cosines (and also the doubles of these). Therefore, their degrees must be powers of 2. 
Our task is to show that these powers are not lower than expected. 

Next, we display certain standard trigonometric identities in terms of the numbers 
a n , b n , c n , d n . These will be used in our arguments. We also give a table of values of 
these numbers for n < 2. 



(64) 
(65) 
(66) 

(67) 

(68) 

(69) 
(70) 
(71) 



a n-l 
K-1 



d n -i 

a n-l 
Cn-1 



,2 _ 

2-bl. 



<-2. 



i _ Vs 

2 an 2 
1 

2«n + ^-On- 



= 2 



= 2-dl 



n 


a n 


b n 


Cn 


d n 





-2 





-1 


-1 


1 





2 


-Vs 


Vs 


2 


y/2 


V2 


V2 


1+^3 
V2 



C.2. The degrees of a n and b n . In addition to Q, it will be convenient to work 
with the field K = Q(y / 3). This has class number 1. That is, its ring of integers Ok 
is a UFD. So, every irreducible in Ok is a prime. Eisenstein's Theorem applies to 
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We define polynomials p n (x) £ Z[i], n > 1: 

= x 2 — 2 
Pn(a?) = p 1 (p n _i(ar)). 



Lemma C.l. For all 1 < k < n 



Pk(a n ) 



a. 



'n—k 



Pkipn) 




n—li 



k = ] 

k > 2. 



1 



We omit the easy induction proof. 
Lemma C.2. p n (x) is irreducible over Q and over K. 

Proof. We show first that p n (x) has the form x 2 " + 2xq(x) ± 2, for some q(x) G Z[se]. 
When n = 1 this is immediate from the definition. Assume the result for n — 1 > 1 
and compute 



Pn(z) = Pn-i(x) 2 - 2 = x 2 " + 2x(2q(x)x 2n 1 ± 2x 2 " 1-1 + 2xq(x) 2 ± 4g(z)) + 2, 



which has the desired form. 

Now consider first the case of Q. We use the prime 2 e Z, and we apply Eisenstein's 
criterion to p n {%), which clearly satisfies it. Thus p n (x) is irreducible over Q. 

For the case K = Q(v / 3), we use the prime 1 + v3 G Ox- (To see that 1 + \/3 is 
irreducible in Or, compute the norm N(l + V3) = (1 + V3)(1-V3) = -2, which 
is prime in Z. Since (9^ is a UFD, 1 + \/3 is prime.) Again, Eisenstein's criterion is 
seen to be satisfied. So p n (x) is irreducible over if. □ 

By Lemma [C-H p n -i(a n ) = p n -i(b n ) — a% = 0, for n > 2. Therefore, using a direct 
calculation to take care of the case n — 1, we have: 

Corollary C.l. deg Q (a n ) = deg Q (b n ) = deg K (a n ) = deg K (b n ) = 2 n " 1 ! for n > 1. 



C.3. The fields F(a n ), F(b n ), F(c n ), F(d n ), for F = Q,K. From identity (64), 



for all n > and F = Q or K. Identity (66), together with F(ao) = F(ai) = F 
and Corollary IC.1[ allows one to prove inductively that F(a n ) = F(b n ), for all n. We 
leave this to the reader. Using this and (!72|) . we get 



for all n > 0. Each of the extensions in f!72|) and f J73|) has degree 2, for n > 1, by 
Corollary EH 

Equations (69) and (71) yield two infinite towers of field extensions 




F(a n ) < F(a n+1 ) 



(73) 



F(b n ) < F(b n+1 ) 



K = K(di) < K(c 2 ) < K(d 3 ) <... 



THE DENSITY OF THE SET OF TRISECTABLE ANGLES 



39 



and 



K = K( Cl ) <K(d 2 ) <K(c 3 ) < 
where each extension has degree < 2. 

Lemma C.3. For all n>0, K(c n ) = K(d n ) = K(a n ) = K(b n ). 

Proof. We already have the last equality. The proof of the remaining equalities is by 
induction on n. The cases n = 0,1 are obvious, using the chart of computed values. 
Assume the result for n — 1. Since, K(a n ) = K(b n ), equations (67) and (68) imply 
that c n ,d n G K(a n ). Thus, we have 



Therefore, since [K(a n ) : K(a n ^i)] = 2, it suffices to show that c n ,d n G" K(a n _i). 
But, if c n G K(a n _i), then equation (70) would show that d n G K(a n -i). So, adding 
identities (67) and (68), we could conclude that a n G i^(a„_i), which is not possible. 
Therefore, c n $ K(a n -i). Similarly, d n ^ K(a n -i). This concludes the induction. □ 

Combining this with Corollary IC-H we get 

Corollary C.2. degK(c n ) = degxidn) = 2 n ~ 1 , for all n > 1. 

It remains to obtain a similar result for the field Q. 

Similarly to what we deduced above from equations (69) and (71), we have Q(c„_i) < 
Q(d n ) and Q(d n _i) < Q(c n ), for all n > 2. We now compute: 



and, similarly, Q(di) = K(di). Now, assume inductively that Q(c„_i) = K(c n _i) and 
Q(d n -i) = K(d n ^i). Then, we have 



fT(a n _i) = K(c n ^) = K(d n ^x) < K(c n ),K(d n ) < K(a n ). 



Q(c 1 ) = Q(V3)=K = K(c l ), 



Q(dn) = Q(c„-l)(d n ) 



K{ 



c n -i)(d n ) = K(d n ) 



and 



Q(c„) = Q(Ci)(c„) 



K(d n _ 1 )(c n )=K(c n ). 



Therefore, 




[Q(c„) : Q) 
[K{c n ) : Q] 
[K(a n ) : Q] 



[K(a n ):K][K:Q] 



Similarly for degq(d n ). 

This completes the proof of Proposition 5 
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